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ABSTRACT 



I^M’s Administrative Terminal Svstem (®TS) is an 
effective data collection device, but its searching capabilities are 
severely limited. Searching an ATS document line-by-line is a time 
consuming process requiring slow printout speeds. A search can 
operate only upon P,°?° lines of text at anv one time. Workinq 
storage limitations make it ne~ ssarv to maintain a quantity of 
separate ATS documents in permanent storage; each of which is brouqht 
in as needed and then returned to storage, that a series of 
duplicate searches is necessary if the total vile exceeds $, 0Q0 
lines. Storage costs as well as access limitations indicate that ATS 
work is practical only with relatively small and highly fluid files, 
unless it is used purely for data capture and text editinq functions. 
Costs can be reduced by putting data into off-line storage when act 
needed and later returning it to the svstem, but this is not very 
practical in the present system configuration. t q be accessible, da+a 
must be maintained in a rioid ATS format. Random searching cannot be 
done except in a very limited sense. T n general, this project was 
considered worthwhile as a demonstration of the potential and 
limitations of terminal systems such as ATS. (M*) 




EDO 4515 



O 



A FAUL TECHNICAL MEMORANDUM 
FTM 70-1 



U.ft. DEPARTMENT Of HEALTH. EDUCATION 
ft WELFARE 
OfflCI OF IDUCATION 
THIS DOCUMENT HAS IEEN REPAODUCEO 
EXACTLY AS A ECtfVEO FAC M THE PERSON 0 A 
ORGANIZATION ORIGINATING IT POINTS Of 
VIEW OR OPINIONS STATEO DO NOT NECCS 
SARILV REPRESENT OfflClALOFf ICE Of EOU 
CATION POSITION OR POLICY 



Searching and Editing Bibliographic 
Records Using the IBM Administrative 
Terminal System (ATS). 

By Linda Webb 
James Turner 



May 1970 



Ni 

<o 

o 

o 



b 



* 
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facilities of Syracuse University Library and the 
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INTRODUCTION 



The purpose of the FAUL Technical Memoranda (FTM) series is to 
disseminate quickly to librarians and information scientists the objectives, 
methods, procedures, analyses, conclusions, and recommendations relating to 
the performance of small projects in applied research. These projects may 
be imbedded in large long-term efforts, or they may have been undertaken to 
answer specific questions which bear upon the rational improvement of 
library procedures and services. Furthermore, such research may have been 
done wholly within a single library and by its own staff, or by the FAUL 
Central staff or committees, or by outside specialists under contract. 

The material contained within these reports is used by appropriate 
FAUL staff in planning and decision-making, as well as for preserving an 
historical public record of the technical interests of the Five Associated 
University Libraries. 

The material contained herein should not be reproduced in any form 
without written permission of the author. Publication in this series in 
no way denies an author from disseminating the information in other forms. 
Comments, corrections, and suggestions for improvement are most welcome. 

These memoranda are routinely sent to the ERIC Clearinghouse on Library 
and Information Science for announcement in its publications and in 
Research in Education. 



Coordinator of Library Systems 
Five Associated University Libraries 
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SEARCHING AND EDITING BIBLIOGRAPHIC RECORDS USING THE IBM ADMINISTRATIVE TERMINAL 

SYSTEM (ATS) 

Linda Webb and James Turner 
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BACKGROUND 

In October 1968 the Five Associated University Libraries (PAUL)— ^ committed 
resources to a series of experiments directed toward constructing a central file 
of bibliographic records in a cooperative environment. This series of experi- 
ments is called the Masfile Pilot Project* The first experiment , Masfile-I, was 
devoted to building a small file containing bibliographic records from the shelf 
lists of each library. 

Since each member library uses the Library of Congress Classification 
Schedules to organize its collections! the PAUL Systems Committee decided to se«- 
lect the first five hundred shelf list cards in LC class Z116-Z1000.5, Book Trade 
and Library Science, from each library as the data base. A Masfile Input Group 
was formed composed of cataloging personnel from each library to advise the pro- 
ject and to implement the local arrangements. The shelf-list cards were collec- 
ted and edited by a FAUL Central staff member, then forwarded to the Technical 
Information Dissemination Eureau at SUNY-Buffalo. The data were converted into 
machinable form on IBM2741 terminals connected to the Administrative Terminal 
System (ATS) running on an IBM 360/40. A printout containing approximately 1800 
entries was produced on multiple-part forms and distributed to each of the five 
libraries * 

Except in the case of Syracuse University, each member library compared 
the printout with local catalogs manually. Call numbers and quantities were added 
to citations which other member libraries submitted. These updated lists were 
mailed back to*Buffalo and appropriate changes were made to the machine file by 
ATS terminals,—' 

PROCEDURE 

At Syracuse University Library, it was decided to modify Masfile records 
directly through terminals located at Syracuse and connected by dataphone to the 
ATS processor at Buffalo, a distance of about 170 miles. By this decision the 
adaptability of ATS to bibliographic searching, remote file access, and editing, 
could be tested in a practical environment, 

In June 1969 therefore, the Technical Information Dissemination Bureau (TIDB) 
at SUNY-Buffalo transferred seven files containing the 1800 Masfile records to a 
storage area in the computer which a Syracuse terminal operator could access re- 
motely, By this means, Syracuse Library staff could access the file whenever 



*/ PAUL Includes the libraries of SUNY-Binghamton, SUNY-Buffalo, Cornell University, 
University of Rochester, and Syracuse University, 

1/ See Five Associated University Libraries Mae file Pilot Project Final Report* 

Apri* 11, 1969 for additional details, (Available from the ERIC Document 
Reproduction Service, number ED 028 801) 




-2 



the ATS was in operation at SUNY-Buffalo. The transfer process was simple: 

TIDB assigned the files to a Syracuse account number as stored messages, and 
the Syracuse terminal operator transferred the files/messages to Working and 
Permanent Storage areas within the ATS. 

These files contained a total of 35,795 lines, or approximately 887 
Permanent Storage Records (PSR). One PSR is equal to about 1500 characters in 
disc storage. Seven separate files (or ATS Documents) were needed because 
the ATS system limits the size of a Document to a maximum of 9,999 lines. The 
ATS Documents contained different quantities of lines since records of variable 
length were stored. 

The staff at Syracuse proceeded to update Masfile-I records by adding the 
local call numbers of its. holdings for books entered originally by other FAUL 
institutions. Because multiple call numbers for a single book were entered on 
a single line, 453 line entries were made containing a total of 481 call numbers. 
Excluding training time and delay in returning the updated files to TIDB, 23 
hours and 48 minutes were needed for the project. An overall average of 3.00 
minutes per entry, or 20.0 entries per hour was the measured rate for entry modi- 
fication. The "per hour" time included time used to locate the position of the 
entry within the file and the typing and insertion of the new character strings. 
Also included in the figure was the time needed to sign-on to ATS, move an ATS 
Document from Permanent to Working Storage, send and receive messages to other 
terminals, store the updated file, and sign-off. Since these were all part of 
the standard operating procedure it seemed proper to include them. Besides, 
they occupied a very small percentage of the total time. Mote costly, time 
consuming, and annoying were the frequent telephone line interruptions which 
averaged over two per day; about one interrupt for each hour connected. The staff 
worked a total of twenty-eight days on the project and made sixty-five calls. 

Since standard telephone lines were used, a line interruption meant re-initiating 
the call at the regular three-minute rate. A direct line to TIDB, or a WATS line 
would have reduced the cost; having ATS available on the Syracuse campus would have 
eliminated the problem altogether. 

THE EDITING SEQUENCE 

The original printout of Masfile-I was used as well as an alphabetical 
listing of the records, Since ATS was used as the original data capturing tech- 
nique, the order of the records in the file was operationally random; the original 
records were not sorted before conversion. The first step then was for searchers 
to check each entry using the alphabetical listing made from the ATS file against 
the Syracuse holdings. Call numbers were written on the alphabetical listing. 

The second step was to manually transfer Syracuse call numbers from items on 
the alphabetical list to same items as they appeared on the random Masfile-1 
list, (See Figure 1). 

For comparison purposes three different methods of adding the call numbers 
to the machine file were used. The "print by line number" ATS command method was 
used first, resulting in a line-by-line printout of Masfile until the location of 
the next insertion was ppproached. This method was extremely slow and inefficient. 
However, the "locate" command was soon added to the ATS command set by SUNY- 
Buffalo, and the procedure was Immediately changed, to a' second method. Using 
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"locate" and "print with line number" the process was speeded up* but it soon 
became evident that "locate" was also intolerably slow. In an ATS Document of 
4860 lines an average of twenty seconds was needed to locate an entry. An ATS 
Document of 8000 lines took over a minute. The length of the string of charac- 
ters to be located made no appreciable difference in the "locate" time. Since 
"locate" searches the entire ATS Document in working storage sequencially * its 
speed is related to the size of the Document, It was found that a Document of 
about 2000 lines was the practical limit for using the "locate"command * An 
efficient operator using "locate" and "print with line numbers" consumed an 
average of three minutes to locate the position of the entry* type the entry, 
and transmit it to the file, (See Figure 2), 

SEARCHER 'S ROULETTE 

A third searching method was developed to increase the speed of inter- 
action between the file and the operator. It was discovered that each citation 
in the list occupied an average of twenty lines. Since the line number of the 
last entry was known* it was possible to calculate the neighborhood of the first 
line number of the next entry, because each line in the ATS Document was num- 
bered consecutively. The terminal operator counted the number of citations 
from the last edited entry through the next entry to be edited and multiplied 
it by twenty, adding this product to the line number of the last edited entry. 

For example: suppose the line number of the last entrv of a Syracuse call number 

was 121; then three listings followed for which no Syracuse holdings existed 
and would therefore be skipped* but in the fourth listing a call number addition 
was needed. In this case multiplying twenty (lines per book listing) by four 
(the fourth entry away is to be modified) yields a product of eighty. Adding 
eighty to the line number of the last entry* 121* sums to 201. This sum is the 
approximate line number in the next bibliographic record to be edited. Now in- 
stead of using the "print with line number" command, which prints a minimum of 
five lines and is therefore slow* the single line print command is used which 
provides almost Instantaneous response. Line 201 provides a "neighborhood" 
reference point which can be checked against the printout (see Figure 3). 

It was found that this calculation was usually within five lines of the next 
entry location. 

Using this method* an efficient operator took an average of 1,5 minutes per 
entry, half the time needed using the "locate" and "p*int" method. A major draw- 
back of the third method, dubbed "searcher’s roulette", was the difficulty in 
finding an operator who could use it. 

Searcher’s roulette was devised by the library programmer* who had no dif- 
ficulty using it and who did all the Syracuse Hasfile updating. Attempts were 
made to train two other operators. Both were MT/ST operators. The first had a 
high school diploma and a few college credits in Business* She found the neces- 
sary calculations difficult and was unable to master the ATS commands. 

The second operator also had a high school education in Business* but she 
found the technique challenging and was eager to learn it. The project was al- 
ready completed before an attempt to train her could he arranged. However* it 
seemed likely that she could have mastered searcher's roulette in a reasonable 
time. An above-average intelligence coupled with a positive attitude toward 
machines seem to be the necessary requirements for ATS Masfile updating using 
searcher's roulette techniques. 
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OBSERVATIONS AND CONCLUSIONS 

i 

As a result of this project, the following observations and conclusions 
can be drawn concerning the use of ATS in this particular library environment: 

1« ATS is an effective data collection device, but its searching capabilities 
are severely limited: 

a, Searching an ATS Document line-by-line is a time-consuming process 

requiring slow printout speeds, 

b, A search can operate only upon 9,999 lines of text at any one time* 

c, Working Storage limitations make it necessary to maintain a quantity 

of separate ATS Documents in Permanent Storage, each of which is 
brought in as needed and then returned to storage, so that a series 
of duplicate searches is necessary if the total file exceeds 9,999 
lines. 

d % Storage costs as well as access limitations indicate that ATS work 
is practical only with relatively small and highly fluid files, 
unless it is used purely for data capture and text editing functions. 
Costs can be somewhat reduced by putting data into off-line storage 
when not needed and later returning it to the system, but this is 
not very practical in the present system configuration. 

e, To be accessible data must be maintained in a rigid ATS format. It 

must therefore be reformatted for other types of processing, but 
returned to ATS format if more ATS editing work is needed. 

f, Random searching cannot be done except in a very limited sense. 

Therefore, the searcher must have an "index" of the ATS Document 
by line number if searching is to be effective. 

g, Precise knowledge of the file structure and data location is 

mandatory for all practical purposes. 

2. Telephone charges and line interruptions more than doubled the cost per 
entry in this project, Vse of WATS or a direct line would result in sub- 
stantial savings, although line interrupts would continue to be a problem, 
especially in WATS. Investigation of a direct line cost showed that at 
least two hours daily use of the terminal would be required for justification, 

3. Long distance work of this kind presents no real problems, so long as the 
user stays informed about schedules, system changes, and the like. If data 
was to be processed other than in ATS (e,g., high-speed printer), transpor- 
tation problems could arise. 

4. The use of machine^oriented clerical personnel rather than a programmer 
would have reduced costs further. 

5. Terminal printout should be restricted as much as possible, since the 
typewriter’s effective speed is 148 words per minute* A rule of thumb of 
the project was "don’t print any more than necessary." 

6. Had the project been costed in terms of minimum time, not including ex- 
perimentation, the cost would have been $806,34. Over $500.00 of the 
total cost went to experimentation. (See Appendix), 
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A revised "Locate" command now permits a terminal user to printout 
between line ranges e.g. 75-90. This command technique was not 
available during the experiment* 

Syracuse University Library has not ruled out the use of ATS altogether, 
for it has its definite uses as a data capturing system and text editor when 
properly handled* It is being considered among other applications, for use 
in editing data captured with off-line devices. Editing of catalog data may 
be done manually from a listing produced by high speed printer from data 
captured on IBM MT/ST f s, and corrections entered into the file via ATS, 
using line numbers for access. Corrected data would then go out of the ATS 
system for further processing by other computer software systems. 

In general, this project was considered worthwhile as a demonstration of 
the potentials and limitations of terminal systems such as ATS. The splendid 
cooperation of the staff of the Technical Information Dissemination Bureau at 
SUNY-Buffalo and its director, Dr. Gerald Lazorick has been greatly appreciated 
by Syracuse, 

i 
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Co.iputer assigned item 




22t 



30a 

30b 

30c 

40t 

60b 



ATS Document line 
number tike re new 
addition ie to be made 



001231 



N 



A new record begins here 



1 



90 t None 

90a 044412791632 
lOt Oehlerts, Donald D. 

20t A study to determine the feasibility of 
establishing a cooperative technical 
procecsing program and direct transmission 
of interlibrary loans **. With a foreword by 
Le Moyne W. Anderson. 

Study to determine the feasibility of establishing 
a cooperative technical processing program 
and direct transmission of interlibrary loans 
Denver 

Association of State Institutions of Higher 
Education in Colorado 
1962 

x. 45 p. tables. 23 cm. 

A report prepared for and submitted to the Associa 
of State Institutions of Higher Education In 
Colorado. 

Holdings statement 60c Bibliography: p. 38-39. 

70t Librarians — Colorado. 

70t Cataloging. Cooperative. 

70t Inter-library loans. 

72 1 Association of State Institutions of Higher 
Education in Colorado. 

73t A cooperative technical processing program and 
direct transmission of inter-library loans. 

80b NB1SU-H/2732C603/ (1) 732C 6#$ j 0) 

^ Manually added holdings 

None 

044502610703 
Cessolis, Jacobus de. 

The game of chesse*, by 
Reproduced in facsimile 
British museum. 

(etc.) 



for SUNT- 
Binghamton 



fat mo. 

BUK 690203001 230110HOLD 69/02/03000012300135 



90t 
90a 
lOt 
20t 

\ 

These field tags do not appear in final 
printout of the book catalog 



' statement by Syracuse 
University Library 

William Caxton. 
from a copy in the 
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FIGURE li Sample printout of unedited Masfile-1 record 
with field tags and manual addition of 
Syracuse University Library holdings statements 
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An * indicates lines typed by 
the terminal operator 



Cowmen tary 

Searcher is told which Vine 
number in the file the ATS 
routine is ready to operate 
on 

Operator types a LOCATE (1) 
command to find out if the 
character string beginning 
UBiSu-U/eta. can be found. 
Then continues this strateg 
throughout the part of the 
dialog “ 



A character string is located 
in the neighborhood of the 
line number which the searcher * 
wants to edit 

Print with line number command 
causes the computer to print 
out a bibliographic record 
preceded by field tags, followed i 
by end-o f -field indicators (-) 
and the line number in the file 
which contains the field 

After checking with a paper 
printout of the file (which did 
not contain line numbers), 
the searcher locates the line 
after which the new line entry 
is inserted, 
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NEXT NUMBER 4874 
*lNBiSU-H/Z695.1C7s45 
NOT FOUND . 

*lNBiSu-H/Z695 . 1C7 s45 
NOT FOUND 
*1068215220272 
NOT FOUND 

*lShariff, Waser, 1925 
NOT FOUND 
lNBiSu-H/Z699S47 
NOT FOUND 

*lSharp, Harold S ed. 

NOT FOUND 
*lNSyU/Z665S4 

'STRING 'NSyU/Z665S4 ' WAS FOUND ON LINES: 
1241, 1329, 1405, 1424 



*pnl250 

.SIGNAL WHEN READY 

30b Scarecrow Press- 1250 

30c 1964- 1251 

40t 759 p. illus. 22 cm.- . 1252 

60b Includes bibliographies.- 1253 

70t Information storage and retrieval — Addresses/ / 

70t Libraries — U. S. — Addresses, essays, lectures. - 

73t Title.- 1256 

80b NBiSU-H/Z699S47/(l)- 1257 

80c NBuU/Z699S47/(2H 1258 

80d N1C/Z699S53/(1)- 1259 

80e NRU/Z699S53r/(l)- 1260 

92t 010.78- 1261 

94t 64-11792-— 1262 



90t None- 

90a 068515241583- 

lOt Sharp, John R- 



1264 

1265 
+ 1266 

20t Some fundamentals of information retrieval.- 1267 
22t Some fundamentals of information retrieval - 1268 
30a New York- 1269 



*80f NSyU/Z699S47/(l)- 
_1260 

AFTER-- 80e NRU/Z699S5 
NEXT NUMBER 4875 



--MOVE-- 80f NSyU/Z699S*y 




FIGURE 2: ATS Dialog Sequence: Method 2, using 

"LOCATE" and "PRINT WITH LINE NUMBER" commands 



An * indicates lines typed by 
the terminal operator 
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Commentary 



Searcher is told which line 
number in the file the ATS 
routine is ready to operate on 

Operator instructs ATS to go 
to line 1400 and print it on r 
the terminal; the ?3t^— response 
is compared visually 'vith the 
file printout and seen to be 
beyond the line where new data 
is to be added by the operator 

After a sequence of approxi- 
mations the correct part of 
the correct record is located 
and the new data inserted by 
the AFTER — MOVE commands 




NEXT NUMBER 2151 
*1400 

73t Title .i 
*1415 

60b Bibliography: p. 131-133. - 

*1405 

90t Z696.D5199- 
*1402 

80c NBuU/Z696U55/ (4)4 
* 2.401 

80b NB1SU-H/Z695U58/ (1)- 
*1403 

94 1 17-26005 rev— 

*80f NSyU/025. 4U581/ca/ (1) - 

>*rol402 

AFTER— 80c NBuU/Z606U —MOVE— 80f NSyU/025. 4 *y 
NEXT NUMBER 2152 
*1500 

90t Z695.95.U53 19564 
*1497 

80e NRU/A106U56m/ (1)4 
*2.494 

73t Title. - 
*1495 

80b NB1SU-H/Z663.34M3/ (1)4 
*80f NSyU/ Z6621U58M2 ? (1)4 
*ml497 

AFTER— 80e NRU/A106U5 — MOVE— 80f NSyU/Z6621 *y 
NEXT NUMBER 2153 
(etc.) 




The "Y" indicates that 
the data is correct 
and the command should 
be executed 




FIGURE 3: ATS Dialog Sequence: Method 3, using 

Searcher's Roulette technique 
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APPENDIX 



MASFILE^I SYRACUSE UNIVERSITY LIBRARY PROJECT STATISTICS, 

I. STATISTICS 

Masfile-I file size 



Holdings additions made to the file by Syracuse 

lines added containing 1 call number 
lines added containing 2 call numbers 
lines added containing 3 call numbers 

Disk storage 

Quantity of lines of text per 
Permanent- Storage Record (PSR) 

Quantity of PSR’S 

Quantity of characters per PSR 

Total time used to manipulate Ma9file by 
Syracuse 
in hours 
in minutes 

time used for housekeeping 
time used for editing 

Quantity of calls made from Syracuse to TIDE 
In Buffalo including re-establishing of 
unintentional disconnects 

II s RATE TABLE 

ATS connect time 

(rate changed during project) 

PSR storage 

Long distance telephone from Syracuse to Buffalo 




RATE TABLE AND COSTS 



1,828 records 
35,793 lines of 
text 

433 lines of 
text 
426 
26 

1 



40,825 

887 

1,500 approx. 



23 hrs. 48 min. 
1,428 minutes 
81 
1,347 



65 



$2.00 per hour 

1,75 per hour 

,005 per day per PSR 

*70 for first 3 min. 

.20 for each add. min. 
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I in, costs 

| Base cost Actual cost ^ Projected cost — 



cost per min, 


$ 


.49 


.725 


.937 


cost per entry, Method II 
("Locate" plus "Print 
with Line Number") 


$ 


1.47 


2.172 


2,881 


cost per entry, Method III 
(Searcher’s Roulette) 




.735 


1,067 


1,405 



TOTAL COST $704.09 1036.00 1339.00 



1 / Base cost is what the project would have cost if everything worked 
smooti \y and excess storage was not charged. 

if Actual cost is the out-of-pocket expenses to perform the work of the project 
as described in the report. 

3/ P rojected cost would be project expenses if it were repeated under the 
revised cost structure imposed by the computer center part way through the 
project. The revised schedule reduced the connect time by about $1.00 but 
increased storage charges by $304. Moral: obtain rate guarantees for 
specific time periods before work begins. 



PROJECTED COST COMPONENTS 



Personnel 


$ 225 


Computer connect time 


55 


Disk storage charges 


472 


Long distance telephone 


367 


Data set rental 


220 


TOTAL 


$1339 
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